Search CORE

12 research outputs found

A study on text-score disagreement in online reviews

Author: A Flanagin
A Ghose
A Hotho
A Muhammad
Angelo Spognardi
B Agarwal
BA Sparks
C Cortes
E Cambria
E Cambria
F Bravo-Marquez
HA Schwartz
IE Vermeulen
J Hipp
JR Quinlan
M-T Martín-Valdivia
Marinella Petrocchi
Michela Fazzolari
O Netzer
P Green
Q Zhou
R Pandarachalil
S Poria
SL Lo
T Wilson
TM Mitchell
Vittoria Cozza
W Medhat
X Fang
Y Xia
Z Bu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

In this paper, we focus on online reviews and employ artificial intelligence tools, taken from the cognitive computing field, to help understanding the relationships between the textual part of the review and the assigned numerical score. We move from the intuitions that 1) a set of textual reviews expressing different sentiments may feature the same score (and vice-versa); and 2) detecting and analyzing the mismatches between the review content and the actual score may benefit both service providers and consumers, by highlighting specific factors of satisfaction (and dissatisfaction) in texts. To prove the intuitions, we adopt sentiment analysis techniques and we concentrate on hotel reviews, to find polarity mismatches therein. In particular, we first train a text classifier with a set of annotated hotel reviews, taken from the Booking website. Then, we analyze a large dataset, with around 160k hotel reviews collected from Tripadvisor, with the aim of detecting a polarity mismatch, indicating if the textual content of the review is in line, or not, with the associated score. Using well established artificial intelligence techniques and analyzing in depth the reviews featuring a mismatch between the text polarity and the score, we find that -on a scale of five stars- those reviews ranked with middle scores include a mixture of positive and negative aspects. The approach proposed here, beside acting as a polarity detector, provides an effective selection of reviews -on an initial very large dataset- that may allow both consumers and providers to focus directly on the review subset featuring a text/score disagreement, which conveniently convey to the user a summary of positive and negative features of the review target.Comment: This is the accepted version of the paper. The final version will be published in the Journal of Cognitive Computation, available at Springer via http://dx.doi.org/10.1007/s12559-017-9496-

arXiv.org e-Print Archive

Crossref

Catalogo dei prodotti della ricerca

Archivio della ricerca- Università di Roma La Sapienza

Online Research Database In Technology

Archivio istituzionale della ricerca - Università di Padova

Implicit location sharing detection in social media turkish text messaging

Author: KG Shin
O Ajao
R Pandarachalil
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

2nd International Workshop on Machine Learning, Optimization and Big Data (2016 : Volterra; Italy)Social media have become a significant venue for information sharing of live updates. Users of social media are producing and sharing large amount of personal data as a part of the live updates. A significant share of this data contains location information that can be used by other people for many purposes. Some of the social media users deliberately share their own location information with other users. However, a large number of users blindly or implicitly share their own location without noticing it and its possible consequences. Implicit location sharing is investigated in the current paper. We perform a large scale study on implicit location sharing detection for one of the most popular social media platform, namely Twitter. After a careful study, we prepared a training data set of Turkish tweets and manually labelled them. Using machine learning techniques we induced classifiers that are able to classify whether a given tweet contains implicit location sharing or not. The classifiers are shown to be very accurate and efficient as well. Moreover, the best classifier is employed in a browser add-on tool which warns the user whenever an implicit location sharing is predicted from just to be released tweet. The paper provides the followed methodology and the technical analysis as well. Furthermore, it discusses how these techniques can be extended to different social network services and also to different languages. © Springer International Publishing AG 2016

Crossref

TOBB ETÜ Institutional Repository

Classification of multi-lingual tweets, into multi-class model using Naïve Bayes and semi-supervised learning

Author: AAA Essam Kazem Al-Yasiri
Ayaz H. Khan
B Gupta
C Hong
C Hong
C Hong
J Leskovec
J Yu
M Bilal
M Hasan
Muhammad Zubair
R Pandarachalil
SM Harshita Mandloi
YH Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Accelerating Infinite Ensemble of Clustering by Pivot Features

Author: Amir Hussain
CM Bishop
D Achlioptas
E Bingham
GE Hinton
Guo-Sen Xie
K Bache
Kaizhu Huang
L Breiman
M Brun
M Jiu
R Filipovych
R Pandarachalil
S Ding
S Vega-Pons
V Estivill-Castro
X Li
X Li
X Li
X-B Jin
Xiao-Bo Jin
Y Bengio
Y Lecun
Publication venue: BMC
Publication date: 01/12/2018
Field of study

The infinite ensemble clustering (IEC) incorporates both ensemble clustering and representation learning by fusing infinite basic partitions and shows appealing performance in the unsupervised context. However, it needs to solve the linear equation system with the high time complexity in proportion to O(d3) where d is the concatenated dimension of many clustering results. Inspired by the cognitive characteristic of human memory that can pay attention to the pivot features in a more compressed data space, we propose an acceleration version of IEC (AIEC) by extracting the pivot features and learning the multiple mappings to reconstruct them, where the linear equation system can be solved with the time complexity O(dr2) (r ≪ d). Experimental results on the standard datasets including image and text ones show that our algorithm AIEC improves the running time of IEC greatly but achieves the comparable clustering performance

Crossref

Repository@Napier

Echo state networks (ESNs), belonging to the wider family of reservoir computing methods, are a powerful tool for the analysis of dynamic data. In an ESN, the input signal is fed to a fixed (possibly large) pool of interconnected neurons, whose state is then read by an adaptable layer to provide the output. This last layer is generally trained via a regularized linear least-squares procedure. In this paper, we consider the more complex problem of training an ESN for classification problems in a semi-supervised setting, wherein only a part of the input sequences are effectively labeled with the desired response. To solve the problem, we combine the standard ESN with a semi-supervised support vector machine (S3VM) for training its adaptable connections. Additionally, we propose a novel algorithm for solving the resulting non-convex optimization problem, hinging on a series of successive approximations of the original problem. The resulting procedure is highly customizable and also admits a principled way of parallelizing training over multiple processors/computers. An extensive set of experimental evaluations on audio classification tasks supports the presented semi-supervised ESN as a practical tool for dynamic problems requiring the analysis of partially labeled data

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Microblog sentiment analysis using social and topic context

Author: Anil Bandhakavi
BW Kernighan
D Easley
E Cambria
E Cambria
E Hatfield
F Ren
F Wu
Farhan Hassan Khan
Feng Xia
G Palla
Giuliana Carullo
J Bollen
J Friedman
J Ortigosa-Hernández
Jianpei Zhang
Jing Yang
Miller Mcpherson
MO Jackson
N Godbole
N Simon
R Pandarachalil
S Kiritchenko
S Wasserman
Shuyuan Deng
Tao Chen
Xianghua Fu
Xiaomei Zou
Y Sun
Yan Bo Xie
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

Semi-supervised Echo State Networks for Audio Classification

Author: AJ Eronen
Aurelio Uncini
B Meftah
B Zhang
D Bacciu
D Barchiesi
D Li
D Stowell
D Verstraeten
E Trentin
F Facchinei
F Triefenbach
FM Bianchi
G Fung
G Scutari
G Tzanetakis
H Jaeger
IB Yildiz
J Beltrán
J Zhao
JC Castillo
K Vandoorne
M Belkin
M Lukoševičius
MH Tong
MM Adankon
O Chapelle
P Campolucci
P Di Lorenzo
R Pandarachalil
R Rifkin
S Hochreiter
S Scardapane
S Scardapane
S Scardapane
S Scardapane
Simone Scardapane
SP Chatzis
W Maass
X Dutoit
X Lin
X Zhu
YF Li
Z Fu
Z Shi
ZK Malik
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The Impact of Sentiment Features on the Sentiment Polarity Classification in Persian Reviews

Author: A AleAhmad
A Balahur
A Balahur
A Famian
A Montejo-Ráez
A Neviarouskaya
AK Jain
AK Uysal
B Agarwal
B Agarwal
B Agarwal
B Liu
C Catal
C Chu
C Hung
C Liao
D Fragoudis
D Gao
D Tang
D Vilares
DR Recupero
E Boiy
E Cambria
E Cambria
E Cambria
Ehsan Asgarian
F Sebastiani
FH Mahyoub
FL Cruz
G Forman
G Wang
HK Aldayel
I Dehdarbehbahani
I Habernal
J Steinberger
M Rushdi Saleh
M Taboada
M-T Martín-Valdivia
ME Basiri
ML Bermingham
Mohsen Kahani
N Ofek
N Oliveira
N Taghizadeh
O Appel
Q-F Wang
R Dehkharghani
R Duwairi
R Pandarachalil
R-E Fan
S Ali-Mardani
S Poria
S Poria
Shahla Sharifi
Z Zheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref